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METHODS OF MODIFYING THE PRODUCTION OF ISOPENTENYL 
PYROPHOSPHATE, DIMETHYLALLYL PYROPHOSPHATE AND/OR 

ISOPRENOIDS 

BACKGBQ1 1ND OF THF INVENTION 

5 Fiftl H nf thfi Invention 

The present invention is directed to genes encoding deoxyxylulose-5- 
phosphate synthase (dxps), glyceraldehyde-3-phosphate dehydrogenase (gapd), 
and the lytB gene product, as well as vectors containing the same and hosts 
transformed with said vectors. The present invention also provides methods for 

10 modifying the production of isopentenyl pyrophosphate (IPP) and/or dimethylallyl 
pyrophosphate (DMAPP) and/or an isoprenoid compound (i.e., a compound derived 
from IPP and/or DMAPP). Additionally, the present invention provides a method 
for screening for procaryotic and eukaryotic genes encoding enzymes that 
participate in the nonmevalonate pathway leading to IPP and DMAPP. 

15 Back ground of the Invention 

A plethora of chemical compounds are produced by what are collectively 
known as the pathways of isoprenoid (a.k.a. terpene) biosynthesis. The one feature 
in common to the many isoprenoids (more than 20,000 have been identified in 
plants; Chappell, 1995) is their biosynthesis from the central metabolites and 

20 building blocks for all isoprenoid compounds: the 5 carbon compounds isopentenyl 
pyrophosphate (IPP) and its allylic isomer dimethylallyl pyrophosphate (DMAPP). 
The interconversion of IPP and DMAPP is reversible, and is catalyzed by the 
enzyme IPP isomerase (IPI). Using the IPP and DMAPP building blocks in various 
combinations, a modular assembly process that produces compounds of 5, 10, 15, 

25 20 or more carbons (in multiples of 5) allows the biosynthesis of the basic skeletons 
for the many and various isoprenoids with a relatively small number of basic 
reaction steps. The skeleton of plant carotenoid pigments, for instance, is 
assembled from two molecules of a compound, geranylgeranyl pyrophosphate 
(GGPP), that is itself assembled from 3 units of IPP and 1 unit of DMAPP, and that 

30 also serves as a precursor for many other branches of the isoprenoid pathway in 
plants. 

Isoprenoids are formed from IPP and DMAPP in at least three different 

■ 

i 
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compartments of plants cells: the cytosol/endoplasmic reticulum, the mitochondria 
(and/or Golgi apparatus), and the plastids (McGarvey and Croteau, 1995). The 
source of IPP and DMAPP for isoprenoid biosynthesis in these different 
compartments has long been a matter of some controversy and debate (see Bach, 

5 1995; McGarvey and Croteau, 1995). The well-known "classical" or 
acetate/mevalonate route to IPP and DMAPP proceeds from acetyl-CoA via 3- 
hydroxy-3-methylglutaryl-CoA (HMG-CoA) and mevalonic acid (MVA). The critical, 
rate-determining and irreversible step in this route in animals is the reduction of 
HMG-CoA to produce MVA, catalyzed by the enzyme HMG-CoA reductase 

10 (HMGR; EC 1 .1 .1 .34; reviewed in Goldstein and Brown, 1 990). This enzyme also 
appears to provide an important control point for substrate flow into certain 
isoprenoids in plants (Bach, 1986; Chappell et ah, 1995; Schaller et al. v 1995; 
Stermer et al., 1994). HMGR has been localized in the cytosol of plants, 
associated with membranes of the endoplasmic reticulum (reviewed in Bach, 1995). 

15 There is no convincing experimental evidence that this key enzyme also resides in 
the plastids or mitochondria of plants. An uptake by plastids of IPP to support 
isoprenoid biosynthesis in this organelle has been suggested in the past (Kreuz and 
Kleinig, 1984). However, it has been difficult to reconcile a cytosolic source of IPP 
for plastid isoprenoids with the effective inhibition of cytoplasmically-made 

20 isoprenoids (e.g. sterols) but not of plastid-made isoprenoids (e.g. carotenoids) or 
mitochondrial ones (quinones) by mevinolin, a specific inhibitor of HMG-CoA 
reductase (discussed in Bach, 1995 and McGarvey and Croteau, 1995). 
Furthermore, the demonstrated ability of isolated chloroplasts of Acetabularia 
(Moore and Sheperd, 1977) and spinach (Schulze-Seibert and Schultz, 1987) to 

25 synthesize carotenoids and other isoprenoids from 14 C0 2 necessarily implies a 
chloroplast pathway for IPP biosynthesis. Further confusion arose from reports that 
radiolabeled acetate Is efficiently incorporated by isolated chloroplasts into fatty 
acids but not into carotenoids (see Kleinig, 1 989; Liedvogel, 1 986; Schulze-Seibert 
and Schultz, 1987), a puzzling observation that has been explained by invoking a 

30 metabolic "channeling" that somehow renders the newly-formed acetyl-CoA pool 
available for fatty acid biosynthesis but unavailable for IPP biosynthesis (Kleinig, 
1989). 

2 
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An attractive explanation for these many puzzling and apparently 
contradictory observations is provided by the suggestion of an "alternative" or non- 
mevalonate pathway for the biosynthesis of IPP and/or DMAPP in the chloroplasts 
of plants (Lichtenthaler et ah, 1997) and in algae (Schwender et al., 1996). 

5 Existence of a distinct non-mevalonate biochemical pathway to IPP and/or DMAPP, 
using pyruvate rather than acetate as a substrate, was posited for certain bacteria, 
including Escherichia coli, as early as 1981 (Pandian et a/. f 1981), but this 
alternative pathway in bacteria received little attention until relatively recently (Zhou 
and White, 1991; Rohmer et al., 1993 and Horbach et al., 1993; see Bach, 1995 

10 for a critical discussion of this topic). It appears that many bacteria do not have the 
acetate/mevalonate pathway for biosynthesis of IPP. For instance, a BLAST 
search of the £. coli genome, the entire sequence for which ( has now been 
deposited in GenBank, does not identify any open reading frames with significant 
similarity to known bacterial, plant or mammalian HMG-CoA reductases. Nor can 

15 we discern an HMG-CoA reductase homologue in the genome sequence data base 
of the cyanobacterium Synechococystis 6803. Because ancestors of the 
cyanobacteria are the presumptive progenitors of plant chloroplasts, the non- 
mevalonate route to IPP in these photosynthetic procaryotes is likely to resemble 
that in plant chloroplasts. 

20 Recent reports indicate that plant and algal chloroplast isoprenoids, including 

carotenoids (Lichtenthaler et al., 1997; Schwender et al., 1996), and the medically- 
important isoprenoid paclitaxel (a.k.a. TAXOL®) (Eisenreich et al., 1996) are made 
from IPP supplied via a non-mevalonate pathway. Whether this pathway supplies 
IPP for carotenoids and other isoprenoids manufactured in the various 

25 nonphotosynthetic plant plastids (e.g. chromoplasts, leucoplasts, amyloploasts) or 
in etioplasts or developing chloroplasts is an open question. Several recent reports 
describe a thiamin-dependent enzyme from E. coli (Sprenger et a/., 1997; Lois et 
a/., 1998) and from peppermint (Lange et a/., 1998) that converts pyruvate and 
glyceraldehyde-3-phosphate (GAP) to deoxyxylulose-5-phosphate (DXP). A 

30 homologue of this enzyme was earlier described for Arabidopsis thaliana (Mandel 
et a/., 1996), but the function of the A. thaliana polypeptide had not been 
ascertained. 

3 
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The evidence is compelling that DXP is a substrate in the pathway leading 
to IPP or DMAPP in plants, algae, and certain bacteria including E. coli (Arigoni et 
al M 1 997). However, DXP also is a substrate leading to other essential compounds 
in these organisms, including thiamin and pyridoxal (see Lois et aL, 1998). 
Therefore the biochemical reaction catalyzed by the DXP synthase enzyme is not 
dedicated or restricted to providing substrate for the pathway or pathways leading 
to IPP and/or DMAPP, and isoprenoids derived therefrom. Because the product 
of the reaction is shared with several other pathways, the DXP synthase enzyme 
would not seem to be an obvious candidate for a controlling step in the pathway 
leading to isoprenoids. An ability to modify isoprenoid production, enhancing 
production of desired compounds or reducing that of undesirable compounds, 
would be quite advantageous in many applications. 

Plausible routes to IPP from pyruvate and GAP via DXP have been 
suggested for plants and bacteria (Schwender et a/., 1996; Rohmer et a/., 1993 and 
1996; see Bach, 1995). A gene encoding what may be the second enzyme in the 
pathway, DXP reductoisomerase, has recently been identified in Escherichia coli 
(Takahashi et ah, 1998). The precise nature and sequence of subsequent 
biochemical reactions in the nonmevalonate pathway, and whether regulation of 
flux through this pathway occurs primarily at a single controlling step, as with HMG- 
CoA reductase for the mevalonate pathway, are so far unknown. We have taken 
a genetic approach to delineate the pathway leading to IPP and DMAPP in £. coli 
and plants. 

We earlier made the surprising observation that the reversible 
interconversion of IPP and DMAPP limits the accumulation of carotenoids in E. coli 
(an observation alluded to in Sun et aL, 1996 and Sun et aL, 1998). Others 
(Kajiwara et aL, 1997) have made a similar observation. As a direct result of 
continuing to pursue the same empirical approach (using a colored isoprenoid 
compound to "report" on the amount of IPP and DMAPP available for isoprenoid 
biosynthesis) which revealed that the isomerization of IPP is a limiting step in 
isoprenoid biosynthesis (see U.S. Patent No. 5,744,341), we have now identified 
several other genes which can be used to modify IPP, DMAPP and/or isoprenoid 
production in host cells. 



* 



WO 99/58649 



PCT/US99/07041 



For two of the these genes the functions of the products, glyceraldehyde-3- 
phosphate dehydrogenase and DXP synthase, are well known. The role of the 
product of the third, the lytB gene, is still obscure. Because certain mutations in 
this gene in Escherichia coli confer an ability to tolerate elevated levels of penicillin 

5 (Harkness et al., 1 992; Gustafson et al., 1 993) the lytB gene product would seem 
to play some role in or otherwise influence the biosynthesis of peptidoglycan. It has 
been suggested that the lytB gene product is not an enzyme directly involved in the 
pathway leading to peptidoglycan but rather exerts some control on the production 
of the global regulator molecule guanosine 3\5'-bisphosphate (ppGpp) (Gustafson 

10 et al., 1993; Rodionov and Ishiguro, 1995). The evidence in support of this 
suggestion is correlative and indirect and therefore by no means convincing. 
Because lytB influences the accumulation of certain isoprenoids in £. coli (see 
below), because an analysis of the nearly twenty completed bacterial genomes 
reveals it to be one of the very few genes found only in those bacteria that also 

15 contain genes encoding DXP synthase and DXP reductoisomerase, and because 
we have isolated an apparently chloroplasMargeted lytB cDNA from a eucaryotic 
plant, we speculate that lytB may encode an enzyme that catalyzes one of the later 
reactions in the nonmevalonate pathway leading to IPP and DMAPP in certain 
bacteria and in plant chloroplasts. To avoid confusion, it is important to note that 

20 another gene, which is completely unrelated to the gene under consideration here, 
has unfortunately also been given the name lytB (see, e.g., Garcia et al., 1999). 

There remains a need for methods to enhance the production of desirable 
isoprenoids and reduce the accumulation of undesirable isoprenoids in plants, fungi 
and bacteria that contain a nonmevalonate pathway. There also remains a need 

25 in the art for methods for screening for procaryotic and eukaryotic genes encoding 
enzymes of isoprenoid biosynthesis and metabolism. 

SUMMARY QFTHF INVFNTIQN 
A subject of the present invention is an isolated nucleic acid sequence which 
encodes for a protein having DXP synthase enzyme activity, wherein the nucleic 
30 acid sequence is at least 85% identical to SEQ ID NO: 1 or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 

5 
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85% identical to SEQ ID NO: 2. 

Another subject of the present invention is an isolated nucleic acid sequence 
which encodes for a protein having GAP dehydrogenase enzyme activity, wherein 
the nucleic acid sequence is at least 85% identical to SEQ ID NO: 3 or the nucleic 

5 acid sequence encodes a protein which has an amino acid sequence which is at 
least 85% identical to SEQ ID NO: 4. 

A further subject of the present invention is an isolated nucleic acid 
sequence which encodes for a protein having LYTB activity, wherein the nucleic 
acid sequence is at least 85% identical to SEQ ID NO: 5 or the nucleic acid 

10 sequence encodes a protein which has an amino acid sequence which is at least 
85% identical to SEQ ID NO: 6. 

The invention also includes vectors which comprise any of the nucleic acid 
sequences listed above, and host cells transformed with such vectors. 

Another subject of the present invention is a method of enhancing the 

15 production of IPP, DMAPP and/or an isoprenoid compound in a host cell, 
comprising inserting into the host cell a vector comprising a heterologous nucleic 
acid sequence which encodes for a protein having DXP synthase, GAP 
dehydrogenase and/or LYTB activity, wherein the heterologous nucleic acid 
sequence is operably linked to a promoter; and expressing the heterologous nucleic 

20 acid sequence, thereby producing the protein. 

Yet another subject of the present invention is a method of modifying the 
production of IPP, DMAPP and/or an isoprenoid compound in a host cell, 
comprising inserting into the host cell a vector comprising a heterologous nucleic 
acid sequence which encodes for a protein which modifies DXP synthase, GAP 

25 dehydrogenase and/or LYTB activity in the host cell, wherein the heterologous 
nucleic acid sequence is operably linked to a promoter; and expressing the 
heterologous nucleic acid sequence, thereby producing the protein. 

The present invention also includes a method of expressing, in a host cell, 
a heterologous nucleic acid sequence which encodes for a protein having DXP 

30 synthase, GAP dehydrogenase and/or LYTB activity, the method comprising 
inserting into the host cell a vector -comprising the heterologous nucleic acid 
sequence, wherein the heterologous nucleic acid sequence is operably linked to a 
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promoter, and expressing the heterologous nucleic acid sequence. 

Also included is a method of expressing, in a host cell, a heterologous 
nucleic acid sequence which encodes for a protein which modifies DXP synthase, 
GAP dehydrogenase and/or LYTB activity in the host cell, the method comprising 
inserting into the host cell a vector comprising the heterologous nucleic acid 
sequence, wherein the heterologous nucleic acid sequence is operably linked to a 
promoter; and expressing the heterologous nucleic acid sequence. 

Another subject of the present invention is to provide a method for screening 
for eukaryotic genes which encode enzymes involved in isoprenoid biosynthesis 
and metabolism. 

RRtFF DFSCR IPTION OF THF DRAWINGS 

A more complete appreciation of the invention and many of the attendant 
advantages thereof will be readily obtained as the same becomes better 
understood by reference to the following detailed description when considered in 
connection with the accompanying drawings, wherein: 

Figure 1 is an illustration showing that IPP and DMAPP serve as the central 
metabolites leading to an immense variety of different isoprenoid compounds in 
plants, bacteria, and other organisms. The mevalonic acid pathway via acetyl-CoA 
in the cytosol of plants and in animals is well characterized. A second route from 
pyruvate and GAP, not yet elucidated, is thought to operate in the chloroplasts of 
plants and algae, in cyanobacteria, and in many bacteria. Later reaction steps in 
the pathway remain to be determined and are therefore denoted with question 
marks. The pathway is shown leading to a box containing IPP and DMAPP 
because it has not been established which of these two is the initial product of this 
pathway. Abbreviations: G3P, glyceraldehyde-3-phosphate; G3PD, GAP 
dehydrogenase; DXPS, DXP synthase; IPP, isopentenyl pyrophosphate; I PI, IPP 
isomerase; DMAPP, dimethylallyl pyrophosphate; FPP, farnesyl pyrophosphate; 
GPP geranyl pyrophosphate; GGPP, geranylgeranyl pyrophosphate. 

Figure 2 illustrates a route to IPP/DMAPP from pyruvate and glyceraldehyde-3- 
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phosphate (GAP), which is thought to operate in the chloroplasts of plants and 
algae, in cyanobacteria, and in many bacteria including £ coll Later reaction steps 
in the pathway remain to be determined and are therefore denoted with question 
marks. The pathway is shown leading to a box containing IPP and DMAPP 
5 because it has not been established which of these two is the initial product of this 
pathway. Enzymes of interest are shown in bold white text in a black box. 
Abbreviations: DMAPP, dimethylallyl pyrophosphate; DXPS, deoxyxylulose-5- 
phosphate synthase; DXPR, deoxyxylulose-5-phosphate reductoisomerase; FPS, 
farnesyl pyrophosphate synthase, GAPD, glyceraldehyde-3-phosphate 
10 dehydrogenase; GGPP, geranylgeranyl pyrophosphate; IPI, isopentenyl 
pyrophosphate isomerase; IPP, isopentenyl pyrophosphate. 

Figure 3 illustrates that the carotenoid phytoene is derived by a head-to-head 
condensation of two molecules of the GGPP compound, which itself is 
assembled from 3 molecules of IPP and 1 molecule of DMAPP. Abbreviations: 
15 FPP, farnesyl pyrophosphate; GPP geranyl pyrophosphate; PPPP, prephytoene 
pyrophosphate. 

Figure 4 is a schematic illustration and restriction map of plasmid pAC-LYC 
(Cunningham et al., 1994) which contains genes (crtE, crtB, and crtl) of Erwinia 
herbicola encoding all of the enzymes required for production of the pink-colored 
20 isoprenoid pigment lycopene from the colorless IPP and DMAPP compounds. Cm, 
chloramphenicol resistance gene. 

Figure 5A is a cDNA sequence and Figure 5B is the predicted amino acid sequence 
of a putative DXP synthase isolated from a flower cDNA library of Tagetes erecta 
(SEQ ID NOS: 1 and 2). The cDNA is incorporated into the plasmid pMarDXPS. 

25 Figure 6A is a cDNA sequence and Figure 6B is the predicted amino acid sequence 
of a chloroplast isoform of GAP dehydrogenase isolated from Arabidopsis thaliana 
(SEQ ID NOS: 3 and 4). The cDNA is incorporated into the plasmid pAtG3PD. 

■ 
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Figure 7A is a cDNA sequence and Figure 7B is the predicted amino acid sequence 
of a LYTB protein isolated from an Adonis palaestina flower cDNA library (SEQ ID 
NOS: 5 and 6). The sequence is of clone Ipi3 except that the 14 bp at the N- 
terminus were obtained from the slightly longer cDNA clone Ipi18. The cDNA is 
incorporated into the plasmid pApLYTB. 

Figure 8 is an alignment of the predicted amino acid sequences of LYTB from 
Adonis palaestina, Synechocystis PCC6803 and E. coll The N-terminal extension 
of the Adonis polypeptide, relative to that of Synechocystis PCC6803, is predicted 
by the program ChloroP (Emanuelsson et al., 1999) to constitute a chloroplast 
transit peptide, serving to target the polypeptide to this organelle in plants. Black 
boxes with white letters are used where all three of the aligned residues are 
identical. Grey boxes with black letters are used where two of the three aligned 
residues are identical. Abbreviations: Ap, Adonis palaestina; Sy, Synechocystis 
PCC6803\ Ec, Escherichia coil 

Figure 9 is a schematic representation of the mapping of an £. coli genomic 
fragment to ascertain which of the genes in this fragment will enhance or impair 
lycopene accumulation in E. coll Deletion mapping of an E. coli genomic fragment 
(the insert in plasmid pEc3.9 is essentially identical to GenBank U32768: 
4879..8819) with genes encoding DXP synthase (orf620) f FPP synthase (ispA) and 
the small subunit of exonuclease VII (xseB). Numbers to the right indicate relative 
lycopene accumulation per mL of liquid culture with plasmids in lycopene- 
accumulating E. coli strain TOP10. The insert in pEc3.9 is oriented in the forward 
direction in the multicopy plasmid vector pBluescript SK-. The EcoRl and Smal 
sites in the vector preceding the genomic fragment and a Kpn\ site following it were 
used, along with A/del, Smal, and Sail sites in the genomic DNA, to construct the 
deletion subclones illustrated. Incomplete genes and open reading frames are not 
shown. 

nFsnmPTiON of thf prfffrrfd EMBOD IMENTS 
The term M isoprenoid M is intended to mean any member of the class of 
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naturally occurring compounds whose carbon skeletons are composed, in part or 
entirely of isopentyl C 5 units. Preferably, the carbon skeleton is of an essential oil, 
a fragrance, a rubber, a carotenoid, or a therapeutic compound, such as paclitaxel. 

A search of the public databases (GenBank) for genes encoding 
homologues of the £. coli lytB gene product unearthed such genes in twelve 
bacteria and in the cyanobacterium Synechocystis PCC6803. A recent publication 
(Potter et al., 1998) discusses the distribution of this gene in various bacteria. Our 
description herein of a homologue in the plant Adonis palaestina reveals for the first 
time that this gene is present and expressed in a eucaryotic organism. A recently 
deposited genomic DNA sequence for the green plant Arabidopsis thaliana 
(GenBank accession number AL035521 ) contains what appears to be a gene (the 
probable coding sequence is interrupted by several apparent introns) encoding 
LYTB in this organism. The predicted sequence of this Arabidopsis LYTB is 
somewhat uncertain (a comparison of the Adonis sequence with that listed given 
in AL035521 for Arabidopsis suggests that several of the exon-intron junctions 
predicted for this gene in the GenBank record are incorrect), but sequence identity 
in a comparison with the Adonis sequence is ca. 80% or more. Partial cDNA 
sequences in the data base of expressed sequence tags (dbEST) that predict 
peptides with sequence similarity to portions of the Adonis LYTB sequence indicate 
that homologues exist and mRNAs encoding LYTB are produced in several other 
plant species including rice (D45948), loblolly pine (AA556723) and soybean 
(AI437981 ). Both the Adonis and Arabidopsis predicted amino acid sequences are 
more than 60% identical to that predicted by the cyanobacterium Synechocystis 
PCC6803 gene, and the two plant and the cyanobacterial sequences are more than 
30% identical to the predicted £. coli gene product. An alignment of the Adonis, 
Synechocystis PCC6803 and E. coli predicted amino acid sequences is shown in 
Figure 7. A number of regions and residues conserved in LYTB are indicated in 
this Figure. 

The term "LYTB activity- is intended to mean the ability of LYTB to affect the 
production of IPP, DMAPP and/or isoprenoids in a host cell containing the lytB 
gene or DNA copy of the lytB mRNA. It has not yet been confirmed that the LYTB 
protein is, in fact, an enzyme. The precise role of the LYTB protein in affecting the 

10 
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production of isoprenoids has not been established. However, whatever the 
mechanism of action of the lytB gene product, we intend to cover that mechanism 
of action by the term "LYTB activity". 

The present invention is directed to an isolated nucleic acid sequence which 
5 encodes for a protein having DXP synthase enzyme activity, wherein the nucleic 
acid sequence is at least 85% identical to SEQ ID NO: 1 or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
85% identical to SEQ ID NO: 2. Preferably, the nucleic acid sequence is at least . 
90%, at least 95% or completely identical to SEQ ID NO: 1, or the nucleic acid 
10 sequence encodes a protein which has an amino.acid sequence which is at least 
90%, at least 95% or completely identical to SEQ ID NO: 2. 

Another subject of the present invention is an isolated nucleic acid sequence 
which encodes for a protein having GAP dehydrogenase enzyme activity, wherein 
the nucleic acid sequence is at least 85% identical to SEQ ID NO: 3 or the nucleic 
15 acid sequence encodes a protein which has an amino acid sequence which is at 
least 85% identical to SEQ ID NO: 4. Preferably, the nucleic acid sequence is at 
least 90%, at least 95% or completely identical to SEQ ID NO: 3, or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
90%, at least 95% or completely identical to SEQ ID NO: 4. 
20 A further subject of the present invention is an isolated nucleic acid 

sequence which encodes for a protein having LYTB activity, wherein the nucleic 
acid sequence is at least 85% identical to SEQ ID NO: 5 or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
85% identical to SEQ ID NO: 6. Preferably, the nucleic acid sequence is at least 
25 90%, at least 95% or completely identical to SEQ ID NO: 5, or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
90%, at least 95% or completely identical to SEQ ID NO: 6. 

In each case, sequence similarity is measured using sequence analysis 
software, for example, the Sequence Analysis software package of the Genetics 
30 Computer Group (University of Wisconsin Biotechnology Center, 1710 University 
Avenue, Madison, Wisconsin 53705), MEG Align (DNAStar, Inc., 1228 S. Park St., 
Madison, Wisconsin 53715), or MacVector (Oxford Molecular Group, 2105 S. 
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Bascom Avenue, Suite 200, Campbell, California 95008). Such software matches 
similar sequences by assigning degrees of identity to various substitutions, 
deletions, and other modifications. Conservative (i.e. similar) substitutions typically 
include substitutions within the following groups: glycine and alanine; valine, 
isoleucine and leucine; aspartic acid, glutamic acid, asparagine and glutamine; 
serine and threonine; lysine and arginine; and phenylalanine and tyrosine. 
Substitutions may also be made on the basis of conserved hydrophobicity or 
hydrophilicity (see Kyte and Doolittle, J. Mol. Biol. 157: 105-132 (1982)), or on the 
basis of the ability to assume similar polypeptide secondary structure (see Chou 
and Fasman, Adv. EnzymoL 47: 45-148 (1978)). If comparison is made between 
nucleotide sequences, preferably the length of comparison sequences is at least 
50 nucleotides, more preferably at least 60 nucleotides, at least 75 nucleotides or 
at least 100 nucleotides. It is most preferred if comparison is made between the 
nucleic acid sequences encoding the protein coding regions necessary for protein 
activity. If comparison is made between amino acid sequences, preferably the 
length of comparison is at least 20 amino acids, more preferably at least 30 amino 
acids, at least 40 amino acids or at least 50 amino acids. It is most preferred if 
comparison is made between the amino acid sequences in the protein coding 
regions necessary for protein activity. 

The present inventors have isolated eukaryotic genes encoding DXP 
synthase from Tagetes erecta (marigold), GAP dehydrogenase from Arabidopsis 
thaliana, and LYTB from Adonis palaestina. All were identified on the basis of an 
enhancement of lycopene accumulation in E. coil The E coli DXP synthase was 
also identified in this same way. 

Suitable vectors according to the present invention comprise a gene 
encoding one or more of the above-identified enzymes involved in IPP, DMAPP 
and/or isoprenoid biosynthesis or metabolism, wherein the gene is operably linked 
to a suitable promoter. Suitable promoters for the vector can be constructed using 
techniques well known in the art (see, for example, Sambrook et aL, Molecu l ar 
Cloning A Laboratory Manual . Cold Spring Harbor Laboratory, Cold Spring Harbor, 
NY, 1989; Ausubel et aL, Current Prot nrnk in Molecular Biology. Greene 
Publishing and Wiley Interscience, New York, 1991). Suitable vectors for 
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eukaryotic expression in plants are described in Frey et al.. Plant J. (1995) 8(5):693 
and Misawa et al, 1994a. Suitable vectors for prokaryotic expression include 
pACYC184, pUC1 19, and pBR322 (available from New England BioLabs, Bevery, 
MA) and pTrcHis (Invitrogen) and pET28 (Novagen) and derivatives thereof. The 

5 vectors of the present invention can additionally contain regulatory elements such 
as promoters, repressors, selectable markers such as antibiotic resistance genes, . 
etc., the construction of which is very well known in the art. 

One or more of the genes encoding the enzymes as described above, when 
cloned alone or in combination into a suitable expression vector, can be used to 

10 overexpress these enzymes in a plant expression system or to inhibit the 
expression of these enzymes. For example, a vector containing one or more of the 
genes of the invention may be used to increase the amount of isoprenoids in an 
organism and thereby alter the nutritional or commercial value or pharmacology of 
the organism. 

15 Therefore, the present invention includes a method of enhancing the 

production of IPP, DMAPP and/or an isoprenoid in a host cell, relative to an 
untransformed host cell, the method comprising inserting into the host cell a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein 
having DXP synthase, GAP dehydrogenase and/or LYTB activity, wherein the 

20 heterologous nucleic acid sequence is operably linked to a promoter; and 
expressing the heterologous nucleic acid sequence. 

The invention also includes a method of modifying the production of IPP, 
DMAPP and/or an isoprenoid in a host cell, the method comprising inserting into 
the host cell a vector comprising a heterologous nucleic acid sequence which 

25 encodes for a protein which modifies DXP synthase, GAP dehydrogenase and/or 
LYTB activity in the host cell, relative to an untransformed host cell, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter, and 
expressing the heterologous nucleic acid sequence. 

The invention further includes a method of expressing, in a host cell, a 

30 heterologous nucleic acid sequence which encodes for a protein having DXP 
synthase, GAP dehydrogenase and/or LYTB activity, the method comprising 
inserting into the host cell a vector comprising the heterologous nucleic acid 
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sequence, wherein the heterologous nucleic acid sequence is operably linked to a 
promoter, and expressing the heterologous nucleic acid sequence. 

The invention also includes a method of expressing, in a host cell, a 
heterologous nucleic acid sequence which encodes for a protein which modifies 

5 DXP synthase, GAP dehydrogenase and/or LYTB activity in the host cell, relative 
to an untransformed host cell, the method comprising inserting into the host cell a 
vector comprising the heterologous nucleic acid sequence, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter, and 
expressing the heterologous nucleic acid sequence. 

10 Preferably, the isoprenoid comprises a compound derived from at least one 

member selected from the group consisting of geranyl pyrophosphate (GPP), 
famesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). By the 
term "derived from", we mean that the isoprenoid contains at least one of the listed 
compounds as a building block. It is also preferable if the isoprenoid comprises at 

15 least one member selected from the group consisting of a diterpene, a carotenoid, 
an essential oil, a fragrance, an isoprene, a cytokinin, a rubber, a quinone, a sterol, 
a hopanoid, a triterpene, a steroid, a prenylated protein, a phytoalexin, a gibberellin, 
a tocopherol, a dolichol, a chlorophyll and a therapeutic compound. 

It is most preferred if the isoprenoid comprises at least one member selected 

20 from the group consisting of an essential oil, a fragrance, a rubber, a carotenoid 
and a therapeutic compound, such as paclitaxel. 

The heterologous nucleic acid sequence may originate from a eukaryotic or 
procaryotic cell. By the term "originate from", we intend to mean that the sequence 
information for the heterologous nucleic acid came from the eukaryotic or the 

25 procaryotic cell. However, the specific nucleic acid itself does not have to be from 
the organism. The nucleic acid may come from the organism, or it may be 
synthetically produced using recombinant nucleic acid techniques known in the art. 

Preferably, the heterologous nucleic acid sequence comprises a nucleotide 
sequence for dxps, gapd and/or lytB. It is most preferred that the heterologous 

30 nucleic acid sequence comprises a nucleotide sequence which encodes a dxps, 
gapd and/or a lytB gene and is at least 85% identical, preferably at least 90%, at 
least 95% or completely identical, to SEQ ID NO: 1 , 3 and/or 5, respectively, or the 
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nucleic acid sequence encodes a protein which has an amino acid sequence which 
is at least 90%, at least 95% or completely identical to SEQ ID NO: 2, 4 and/or 6, 
respectively. Identity is determined as noted above. 

The term "modifying the production" in the methods of the invention means 
that the amount of target compounds produced (e.g., IPP, DMAPP and/or 
isoprenoids) can be enhanced or reduced, as compared to an untransformed host 
cell. Thus, in accordance with an embodiment of the present invention, the 
production or the biochemical activity of the target compounds (or the enzymes 
which catalyze their formation) may be reduced or inhibited by a number of different 
approaches, available to those skilled in the art, including but not limited to such 
methodologies or approaches as anti-sense (e.g., Gray et al., 1992), ribozymes 
(e.g., Wegener et al., 1994), co-suppression (e.g. Fray and Grierson, 1993), 
targeted disruption of the gene (e.g., Schaefer et al., 1997), intracellular antibodies 
(e.g., see Rondon and Marasco, 1997) or whatever other approaches rely on the 
knowledge or availability of the nucleic acid sequences of the invention, or the 

proteins encoded thereby. 

Host systems according to the present invention can comprise any organism 
that utilizes a nonmevalonate (i.e., via DXP) pathway for production of IPP and/or 
DMAPP. Organisms which produce isoprenoids using IPP and/or DMAPP derived 
from a nonmevalonate pathway include plants, algae, certain bacteria, 
cyanobacteria and other photosynthetic bacteria. Transformation of these hosts 
with vectors according to the present invention can be done using standard 
techniques. See, for example, Sambrook et al., Molecular Cloning A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1989; Ausubel 
et al. f Current Pmtnmls in Modular Biology . Greene Publishing and Wiley 
Interscience, New York, 1991. 

Alternatively, transgenic organisms can be constructed which include the 
nucleic acid sequences of the present invention. The incorporation of these 
sequences can allow the controlling of isoprenoid biosynthesis, content, or 
composition in the host cell. These transgenic systems can be constructed to 
incorporate sequences which allow for the overexpression of the various nucleic 
acid sequences of the present invention. Transgenic systems can also be 
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constructed which allow for the underexpression of the various nucleic acid 
sequences of the present invention. Such systems may contain anti-sense 
expression of the nucleic acid sequences of the present invention. Such anti-sense 
expression would result in the accumulation of the substrates of the enzyme 
encoded by the sense strand. 

Having generally described this invention, a further understanding can be 
obtained by reference to certain specific examples which are provided herein for 
purposes of illustration only and are not intended to be limiting unless otherwise 
specified. 

F X AMPLE 1 

Isolation of plant rnNAs a nd bac te ri a l genes that enha nce carotenoid accumulation 
in Escherichia coli 

The plasmid pAC-LYC (Cunningham et al. a 1994 and 1996) contains genes 
encoding all of the enzymes required for the formation of lycopene from IPP and 
DMAPP (see Figure 3). Cells of E.coli containing the plasmid pAC-LYC accumulate 
the carotenoid lycopene and thereby form colonies on solid growth medium that are 
pink in color (Cunningham et al., 1994). When cDNA or genomic libraries of 
plants, cyanobacteria, or bacteria are introduced into the lycopene-accumulating 
£. coli strain (the complete methodology is contained in Cunningham et a/., 1994 
and 1 996), a rare few of the colonies formed after spreading of the cell culture onto 
solid growth medium are much deeper in color than is typical of the vast majority 
of the colonies. The enhancement of color and concomitant carotenoid pigment 
accumulation in E. coli is prima facie evidence for an enhancement of flux into and 
through the pathway or pathways leading to the immediate precursors of 
carotenoids and all other isoprenoid compounds: IPP and its allylic isomer DMAPP. 
Genes encoding subsequent enzymes in the pathway leading to lycopene are 
already present in the multicopy plasmid pAC-LYC (see Figure 3). 

Among the cDNAs and genes isolated with this screening methodology are: 
Tagetes erecta (marigold) cDNAs encoding a homologue of DXP synthase (the 
DNA sequence of the longest is given in Figure 4), an E.coli genomic clone 
containing the DXP synthase (and no other complete open reading: a genomic 
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fragment from the Sph\ site which lies at base 5210 in the sequence listed under 
GenBank U32768 to the Nde\ site at base 8003 in this GenBank sequence was 
found to enhance carotenoid accumulation in E. coli. This region encompasses the 
open reading frame earlier referred to as f620, and now as the DXP synthase), 

5 Arabidopsis thaliana cDNAs encoding a chloroplast isoform (Shih et aL, 1991 and 
1992) of the enzyme GAP dehydrogenase (the sequence of the longest of these 
isolated cDNAs is given in Figure 5) ( and an Adonis palaestina cDNA encoding a 
homologue of the E. coli lytB gene product (Figure 6). The enhancement of 
isoprenoid biosynthesis in E. coli (as indicated by lycopene accumulation) when 

10 cDNAs or genes encoding DXP synthase or GAP dehydrogenase are introduced 
can be understood in the context of the biochemical pathway that has been 
postulated for production of IPP/DMAPP in £. coli (and in other bacteria, 
cyanobacteria and plants). However, at the date of the provisional filing of this 
application, there had been no indications that increased flux through the pathway 

15 to IPP/DMAPP might be obtained by increasing the expression of either of these 
enzymes. A publication after the priority date of this application (Marker and 
Bramley (1999)) shows a salutary effect on isoprenoid accumulation in E. coffin 
which foreign DXP synthase genes are introduced. 

The mechanism of enhancement of isoprenoid production by introduction of 

20 cDNAs encoding LYTB is not yet known. The nonmevalonate pathway leading to 
IPP and/or DMAPP has not yet been elucidated and these cDNAs may encode an 
enzyme subsequent to DXP synthase in the pathway. Alternatively, the 
enhancement of isoprenoid accumulation may involve a mechanism less direct 
(e.g., as for GAP dehydrogenase, involvement in biochemical reactions that utilize 

25 or supply the substrates GAP and pyruvate, or by exerting a regulatory influence 
on isoprenoid pathways). 

FXAMPLE 2 

Arabidopsis th aliana Ta ge t ** *r*cAa and Adonis nataestinn cDNA I ihraries 

A size-fractionated 2-3 kB cDNA library of A. thaliana in lambda ZAPII 
30 (Kieber et al., 1 993) was obtained from the Arabidopsis Biological Resource Center 
at The Ohio State University (stock number CD4-15). Other size fractionated 
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libraries were also obtained (stock numbers CD4-13, CD4-14, and CD4-16). The 
cDNA libraries for Tagetes erects (marigold) and Adonis palaestina (pheasant's 
eye) were constructed for us by Stratagene using cDNAs isolated from flower 
tissues. An aliquot of each library was treated to cause a mass excision of the 

5 cDNAs and thereby produce a phagemid library according to the instructions 
provided by the supplier of the cloning vector (Stratagene; E coli strain XL1-Blue 
and the helper phage R408 were used). The titre of the excised phagemid was 
determined and the library was introduced into a lycopene-accumulating strain of 
E coli TOP10 F (this strain contained the plasmid pAC-LYC) by incubation of the 

1 0 phagemid with the E coli cells for 1 5 min at 37°C. Cells had been grown overnight 
at 30°C in LB medium supplemented with 2% (w/v) maltose and 10 mM MgS0 4 
(final concentration), and harvested in 1.5 ml microfuge tubes at a setting of 3 on 
an Eppendorf microfuge (5415C) for 10 min. The pellets were resuspended in 10 
mM MgS0 4 to a volume equal to one-half that of the initial culture volume. 

15 Transformants were spread on large (150 mm diameter) LB agar petri plates 
containing antibiotics to provide for selection of cDNA clones (ampicillin) and 
maintenance of pAC-LYC (chloramphenicol). Approximately 10,000 colony forming 
units were spread on each plate. Petri plates were incubated at room temperature 
for 2 to 7 days to allow maximum color development. Plates were screened visually 

20 with the aid of an illuminated 3x magnifier and a low power stage-dissecting 
microscope for the rare deep pink colonies that could be observed in the 
background of paler pink colonies. 

pyAMPi f 3 

Enhancement nf C.amtenniri Accu mu l at i on in E . COR hV CDNAs and flffflftS encoding 

25 LYTR. DXP synthase ipp frnmemsR an d G AP Deh y drogen ase. Individually and in 

Combination . 

Attempts were made to maximize production of three very different 
carotenoids in E. coli engineered to accumulate these isoprenoid pigments. The 
specific carotenoids chosen were the linear and lipophilic pink compound 
30 lycopene, the bicyclic hydrocarbon (J-carotene and the much more polar dihydroxy, 
bicyclic carotenoid zeaxanthin. DXP synthase, IPP isomerase, LYTB, and GAP 
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dehydrogenase had been observed to enhance the pigmentation of E coli colonies 
growing on solid agar medium. For a more quantitative appraisal of the influence 
of these genes and cDNAs on carotenoid accumulation, the amounts of carotenoid . 
pigments in liquid cultures were examined with the results given in Figure 1. The 
marigold DXP synthase, an Arabidopsis IPP isomerase and the Adonis LYTB 
cDNAs gave rise to significant and substantial increases in the amount of 
carotenoids accumulated per volume of culture. A partially-sequenced Arabidopsis 
cDNA (GenBank AA605545) with sequence similarity to the Adonis lytB cDNA gave 
results similar to that of the Adonis cDNA (not shown). A synechocystis PCC6803 
lytB gene was less effective but gave a significant enhancement as well. See Table 
1 . GAP dehydrogenase was slightly detrimental to pigment accumlation in liquid 
culture, in contrast to earlier observed enhancement for cultures grown on solid 
media. The influence of this gene on pigment accumulation may depend on the 
specific growth regimen. A similar result, enhancement for cultures on solid media 
but reduction in liquid culture, was also obtained for an £. coli genomic fragment 
containg DXP synthase (see Example 4 below and Figure 8). A combination of 
DXP synthase with IPP isomerase was significantly more effective at enhancing 
pigment accumulation than was either of the individual cDNAs. 

FXAMPLE 4 

Enhancement and Redt ir.tinn nf Lycopene Accumulation in F coli containing 
mult iple copies nf genes encoding DXP synthase and Farnesvl Pyrophosphate 

1 

synthase, respectively 

A number of the inserts in plasmids obtained from dark pink colonies 
selected in screens of the £. coli genomic library (see example 1) contained the 
gene encoding DXP synthase in this organism (see Sprenger et al., 1997; Lois et 
al. f 1998). Deletion mapping of one such £. coli genomic fragment indicated that 
the dark pink colony phenotype and enhanced carotenoid accumulation in liquid 
culture was conferred by a fragment containing the DXP synthase and no other 
complete open reading (Figure 8). A genomic fragment containing the ispA and 
xseB genes immediately upstream of dxps was, in contrast, quite inhibitory for 
carotenoid accumulation (Figure 8). This diminution in pigmentation results, we 
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surmise, from the activity of the farnesyl pyrophosphate (FPP) synthase encoded 
by ispA (Fujisaki et al. v 1990). The FPP synthase would be expected to compete 
with geranylgeranyl pyrophosphate (GGPP) synthase (Figure 1) for IPP and 
DMAPP and thereby deplete the substrate available for carotenoid biosynthesis. 

Table 1 illustrates the influence of cDNAs encoding DXP synthase, GAP 
dehydrogenase, LYTB, and IPP isomerase, individually and in combination, on the 
accumulation of lycppene, p-carotene, or zeaxanthin in three strains of E. coli 



engineered to produce these isoprenoid compounds. 



Table 1. Enhancement of carotenoid accumulation in lycopene, fa- 
carotene, and zeaxanthin-accumulating strains of E. coli by introduction of 
plant cDNAs and cyanobacterial genes individually and in combination. 


Plasmid 


cDNA or 
Gene 
Product 


Lycopene 
% of control 


b-Carotene 
% of control 


Zeaxanthin 
% of control 


pBluescript SK- 


Control (empty 
vector) 


(100) 


(100) 


(100) 


pAfipiTrc 


Arabidopsis 
thaliana IPI 


202 ± 12 


22713 


223 ±4 


pTedxps 


Tagetes erecta 
DXPS 


203 ±11 


193 ±11 


228 ±40 


pAdlytB 


Adonis 

palaestina LYTB 


157 ±9 


180 ±3 


168 ±2 


p6803lytB 


Synechocystis 
PCC 6803 LYTB 


116 ± 11 


111 ±1 


109 ±2 


pAtgapA 


Arabidopsis 
thaliana GAPD 


79±6 


nd 


79 ±3 


Data were obtained with £ constrain TOP10 containing pAC-LYC, pAC-BETA or 
pAC-ZEAX (Cunningham etal., 1994 and 1996, Sun etal., 1996). Carotenoid 
content (per volume of culture) was measured after 48-72 h growth in LB medium. 
Cultures were inoculated with freshly grown individual colonies from agar plates. 
Values are mean ± so for 4 to 13 individual cultures. The empty plasmid pBluescript 
SK* (from Stratagene) served as the control and was used as the vector for all of the 
plant cDNAs and the Synechocystis lytB gene. The influence of GAPD on 
accumulation of b-carotene was not determined (nd). 
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ordinary skill in the art that many changes and modifications can be made thereto 
10 without departing from the spirit or scope of the invention as set forth herein. 
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We claim: 

1 . A method of enhancing the production of isopentenyl pyrophosphate (IPP), 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, relative 
to an untransformed host cell, the method comprising 

5 inserting into the host cell a vector comprising a heterologous nucleic 

acid sequence which encodes for a protein having deoxyxylulose-5-phosphate 
(DXP) synthase enzyme activity, wherein the heterologous nucleic acid sequence 
is operably linked to a promoter, and 

expressing the heterologous nucleic acid sequence, thereby 

10 producing the protein. 

2. The method of claim 1 , wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
farnesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

3. The method of claim 1 , wherein the isoprenoid comprises at least one 
15 member selected from the group consisting of an essential oil, a fragrance, a 

rubber, a carotenoid and a therapeutic compound. 

4. The method of claim 1 , wherein the isoprenoid comprises a carotenoid. 

5. The method of claim 1, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for dxps. 

20 6. The method of claim 5, wherein the nucleic acid sequence for dxps encodes 
an amino acid sequence which is completely identical to SEQ ID NO: 2. 

7. The method of claim 1, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

8. The method of claim 1 , wherein the heterologous nucleic acid sequence 
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originates from a eukaryotic cell. 

9. The method of claim 1, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

10. A method of modifying the production of isopentenyl pyrophosphate (IPP). 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, the 
method comprising 

inserting into the host cell a vector comprising a heterologous nucleic 
acid sequence which encodes for a protein which modifies deoxyxylulose-5- 
phosphate (DXP) synthase enzyme activity in the host cell, relative to an 
untransformed host cell, wherein the heterologous nucleic acid sequence is 
operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby 
producing the protein. 

1 1 . The method of claim 1 0, wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
famesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

12. The method of claim 10, wherein the isoprenoid comprises at least one 
member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 

13. The method of claim 10, wherein the isoprenoid comprises a carotenoid. 

14. The method of claim 10, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

15. The method of claim 10, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 
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16. The method of claim 10, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

17. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein having deoxyxylulose-5-phosphate (DXP) synthase 

5 enzyme activity, the method comprising inserting into the host cell a vector 
comprising the heterologous nucleic acid sequence, wherein the heterologous 
nucleic acid sequence is operably linked to a promoter, and expressing the 
heterologous nucleic acid sequence. 

18. The method of claim 17, wherein the heterologous nucleic acid sequence 
10 comprises a nucleic acid sequence for dxps. 

19. The method of claim 18, wherein the nucleic acid sequence for dxps 
encodes an amino acid sequence which is completely identical to SEQ ID NO: 2. 

20. The method of claim 17, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

15 21. The method of claim 17, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

22. The method of claim 17, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

23. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
20 which encodes for a protein which modifies deoxyxylulose-5-phosphate (DXP) 

synthase enzyme activity in the host cell, relative to an untransformed host cell, the 
method comprising inserting into the host cell a vector comprising the heterologous 
nucleic acid sequence, wherein the heterologous nucleic acid sequence is operably 
linked to a promoter, and expressing the heterologous nucleic acid sequence. 

4 
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24. The method of claim 23, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

25. The method of claim 23, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

26. The method of claim 23, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

27. A method of enhancing the production of isopentenyl pyrophosphate (IPP), 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, relative 
to an untransformed host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having glyceraldehyde-3-phosphate (GAP) 
dehydrogenase enzyme activity, wherein the heterologous nucleic acid sequence 
is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing the 

protein. 

28. The method of claim 27, wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
farnesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

29. The method of claim 27, wherein the isoprenoid comprises at least one 
member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 

30. The method of claim 27, wherein the isoprenoid comprises a carotenoid. 

31. The method of claim 27, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for gapd. 

* * 
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32. The method of claim 31, wherein the nucleic acid sequence encodes an 
amino acid sequence which is completely identical to SEQ ID NO: 4. 

33. The method of claim 27, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

34. The method of claim 27, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

35. The method of claim 27, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell 

36. A method of modifying the production of isopentenyl pyrophosphate (IPP), 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, the 

method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein which modifies glyceraldehyde-3-phosphate 
(GAP) dehydrogenase enzyme activity in the host cell, relative to an untransformed 
host cell, wherein the heterologous nucleic acid sequence is operably linked to a 
promoter, and 

expressing the heterologous nucleic acid sequence, thereby producing the 
protein. 

37. The method of claim 36, wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
farnesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

38. The method of claim 36, wherein the isoprenoid comprises at least one 
member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 

39. The method of claim 36, wherein the isoprenoid comprises a carotenoid. 
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40. The method of claim 36, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

41 . The method of claim 36, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

5 42. The method of claim 36, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

43. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein having glyceraldehyde-3-phosphate (GAP) 
dehydrogenase enzyme activity, the method comprising inserting into the host cell 

10 a vector comprising the heterologous nucleic acid sequence, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter, and 
expressing the heterologous nucleic acid sequence. 

44. The method of claim 43, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for gapd. 

15 '45. The method of claim 44, wherein the nucleic acid sequence encodes an 
amino acid sequence which is completely identical to SEQ ID NO: 4. 

46. The method of claim 43, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

47. The method of claim 43, wherein the heterologous nucleic acid sequence 
20 originates from a eukaryotic cell. 

48. The method of claim 43, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

49. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
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which encodes for a protein which modifies glyceraldehyde-3-phosphate (GAP) 
dehydrogenase enzyme activity in the host cell, relative to an untransformed host 
cell, the method comprising inserting into the host cell a vector comprising the 
heterologous nucleic acid sequence, wherein the heterologous nucleic acid 
5 sequence is operably linked to a promoter, and expressing the heterologous nudeic 
acid sequence. 

50. The method of claim 49, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell 

51 . The method of claim 49, wherein the heterologous nucleic acid sequence 
10 originates from a eukaryotic cell. 

52. The method of claim 49, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

53. A method of enhancing the production of isopentenyl pyrophosphate (IPP), 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, relative 

15 to an untransformed host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having LYTB activity, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing the 
20 protein. 

54. The method of claim 53, wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
farnesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

55. The method of claim 53, wherein the isoprenoid comprises at least one 
25 member selected from the group consisting of an essential oil, a fragrance, a 

rubber, a carotenoid and a therapeutic compound. 
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56. The method of claim 53, wherein the isoprenoid comprises a carotenoid. 

57. The method of claim 53, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for lytB. 

58. The method of claim 57, wherein the nucleic acid sequence encodes an 
5 amino acid sequence which is completely identical to SEQ ID NO: 6. 

59. The method of claim 53, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

60. The method of claim 53, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

10 61. The method of claim 53, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

62. A method of modifying the production of isopentenyl pyrophosphate (IPP), 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, the 
method comprising 

15 inserting into the host cell a vector comprising a heterologous nucleic acid 

sequence which encodes for a protein which modifies LYTB activity in the host cell, 
relative to an untransformed host cell, wherein the heterologous nucleic acid 
sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing the 

20 protein. 

63. The method of claim 62, wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
famesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

64. The method of claim 62, wherein the isoprenoid comprises at least one 
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member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 

65. The method of claim 62, wherein the isoprenoid comprises a carotenoid. 

66. The method of claim 62, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

67. The method of claim 62, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

68. The method of claim 62, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

69. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein having LYTB activity, the method comprising inserting 
into the host cell a vector comprising the heterologous nucleic acid sequence, 
wherein the heterologous nucleic acid sequence is operably linked to a promoter, 
and expressing the heterologous nucleic acid sequence. 

70. The method of claim 69, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for lytB. 

71. The method of claim 69, wherein the nucleic acid sequence encodes an 
amino acid sequence which is completely identical to SEQ ID NO: 6. 

72. The method of claim 69, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

73. The method of claim 69, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 
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74. The method of claim 69, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

75. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein which modifies LYTB activity in the host cell, relative 
to an untransformed host cell, the method comprising inserting into the host cell a 
vector comprising the heterologous nucleic acid sequence, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter, and 

* 

expressing the heterologous nucleic acid sequence. 



10 



76. The method of claim 75, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 



77. The method of claim 75, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

78. The method of claim 75, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 
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Figure 5A: SEQ ID NO: 1 ^ 3 

Tagetes erecta (marigold) cDNA for DXP synthase 



1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
€01 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 



ggcacgagaa 
acaaatggcc 
aaaatggcta 
atgccatcaa 
acatgcaacc 
caaccaccct 



aaaattagct 
caggcggtca 
ctccatcatg 
tcatcaggct 
gaacgattcg 
agtaatcatg 
aggttragga 
taattgcagt 
gccatgaaca 
tgacaacagg 
cgcctgttgg 
aagttrcgcc 
ggataaaaca 
tggttggtgg 
gttggtccgg 
taagattaag 
ccgaaaaagg 
catggtgttg 



aagcgaaaga 
ggcacaggcc 
tgttggcata 
cagaagggtt 
ggttatgatc 
atttgctatg 
gtggtgcatt 
atggctcctt 
agccatcgat 
taggctccat 
actggaaggg 



gcactctcat 
ttgtgtggtg 
cacttcctca 
acaagcgaaa 
aatgaacatg 
caagtattct 
atccaattca 
gatgaactcc 
cttgagttcg 
ttttcaacac 
tatccacaca 
acaaacgttr 
atgctitttigg 
atggccgccg 
gatcggzgac 
atgcagcrra 
cagrrctcrr 
cgcactcagc 
aacracgcga 
catgaaczzg 
tcaaggcgct 
tagatggaca 
tccatgacgg 
caaaggtcac 
tgaaatttga 
ctctcatata 
ggatgacaag 
ttaacacatt 
gctgagcaac 
gaaaccattt 
aagtggttca 
gatagggctg 
tgatacaaca 
catgcgaggc 
gaccgtccta 
actaccagca 
tgattaaaga 



attctcttgt 
ctttgaaggg 
agtttgctta 
gtttttggga 
aggacctgac 
ggggataaac 
catgaaaaac 
gagaagaaac 
agcttggggg 
tccagaagac 
aaatattgac 
ggacttgcag 
tgcaggccac 
gtagagactr 
ggagcgatca 
cccagattcr 
tgcccacagc 
cgatccctca 
agccgcaaag 
cggcaaaaac 
tctatgtctg 
caatcttgaa 
ctccgggacc 
ccgcccgctg 
tactcaaacc 
cccaatattt 
atagtcgcga 
ccaaaaagag 
acgcaatcac 
tgtgctattt 
tgatgttgat 
gtttggtagg 
ttcatggctt 
agagctcatg 
gctgttttag 
aataacaaag 
gggaaataga 



gtgttttctg 
tgggtttgtg 
atccgtcagc 
atcgtagctg 
aacaatggat 
ccaaaactcc 
ccttgcgtgg 
tgtatatacg 
ttgttgaatc 
aagatcatct 
cggaagaagg 
gcttccctaa 
agctctacca 
gctaggaaag 
ccgcaggaca 
aatcrcatca 
cactattgat 
caaggcttca 
gaagtaacaa 
gcactctcta 
aagagcttgg 
gatcttgtcc 
agtcctagtc 
aagttgctgc 
ggcaaacaaa 
cgtggactct 
ttcatgcggc 
ttccctgcac 
ctctgctgca 
attcatcttt 
cttcaaaagc 
agcagatggt 
gtttaccaaa 
aacatggtgg 
gtacccaaga 
gaaccttaat 
gtggccttat 



aatcacacca 
cccatagctc 
caatgcaatc 
tttcaaaaga 



gatcttggat 
aggagctcgt 
gtatcaaaga 
aacagtttca 
gggacgttgg 
tcgagaatga 
gagagacgag 
gcattcctgc 
aacaatcatg 
agcatatgag 
tagtcttaaa 
ggccctgcgc 



agcaattagg 
gtgaaaggaa 
cctctattat 
atgttttcga 
cacattgtta 
agataaaatg 
agaaaaataa 
ttggttgctg 
tatgggcgga 
gtcgttttga 
ggtttagcaa 
cctacaaaga 
ttcctgttag 
ccaactcacc 
cacggtggtt 
ccacagctgt 



cgaggttgga 
tgggatatgg 
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FIGURE 5A (cont.) 
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aactatagtt 
gcatctctgt 
aatctaatca 
agaaggctct 



caaagttgct 
gaccgtggcg 
aacaactagc 
ataggaggac 



tagcagcaag 
gatgcacgat 
aaacgagcat 
ttagctctca 



tgaagttctt 
tctgcaagcc 
gaggtcctca 
cgtctctcat 



gatcgatata 
tcttagctca 
gcaaagagac 
agatactata 
tatatatata 
tattgtactt 
aatctcgtcc 



trgaacatgg 
aagcatattg 
actacatgca 
aaagacaatg 
tatatatata 



ggctcaatct 
cagcaacagt 
ctcaacgcat 
cttcgctgga 
tatatatatt 



ttcttgtaac atacatggtc 



gaccagattg 
tttatcattg 
aaattgaaat 
ctcttttgat 
gtagttttcg 
gcgttgataa 



aaaaaaattg 
gcttgatgga 
ccacagttga 
ttcctggcct 
gatgttgcct 
aggaggctgg 
attgggggca 
atgtcatgct 
catatatata 
atcatataca 
ttgttgacat 
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Figure 5B: SEQ ID NO: 2 7^3 

PREDICTED AMINO ACID SEQUENCE of Tagetes erects (marigold) cDNA for 
DXP synthase 

TRANSLATION from: 55 to: 2232 Length: 726 

1 MALCGALKGG FVPIAQNGYT SSSLLNPSAN AIMPSNKRKF LGIVAVSKEH 

51 ATNEKEDLTT MDKTTSTTLK YSGDKPKTPI LDTINYPIHM KNLCVEELVK 

101 LADELREEIV YTVSKTGGHL SSSLGWELT VSLHHVFNTP EDKIIWDVGH 

151 QAYPHKILTG RRSRMRTIRQ TFGLAGFPKR DESNHDAFGA GHSSTSISAG 

201 LGMAVGRDLL GKNNHVTAVT GDGAMTAGQA YEAMNNAGYL DSNLIIVLND 

251 NRQFSLPTAT IDGPAPPVGA LSRSLTRLQT SQKFRQLREA AKEVTKQLGD 

301 KTHEVAAKMD SLVKGMVGGQ GASMFEELGL YYVGPVDGHN LEDLVYVFDK 

351 IKSMTAPGPV LVHIVTEKGK GYPPAEVAAD KMHGWKFDT QTGKQKKNKT 

401 KTLSYTQYFV DSLVAEAKED DKIVAIHAAM GGGTGLNTFQ KEFPARCFDV 

451 GIAEQKAITF AAGLATEGLK PFCAIYSSFL QRGYDQWHD VDLQKLPVRF 

501 AMDRAGLVGA DGPTHCGAFD TTFMACLPNM WMAPSCEAE LMNMVATAVA 

551 IDDRPSCFRY PRGNGIGSIL PANNKGTLIE VGTGRVIKEG NRVALLGYGT 

601 IVQSCLAASE VLKKIGISVT VADARFCKPL DGNLIKQLAN EHEVL1TVEE 

651 GSIGGFSSHV SErLALNGLL DGHLKWRAMM LPDRYIZHGA QSDQIEEAGL 

701 SS KHIAATVL SLIGGSKZTL HALNV* 
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Figure 6A: SEQ ID NO: 3 8/13 

Arabidopsis thaliana cDNA for a chloroplasts isoform of gIycera!dehyde-3- 
phosphate (GAP) dehydrogenase. 
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Figure 6B: SEQ!DNO:4 g^j 

Predicted amino acid sequence of Arabidopsis thaliana cDNA for a chloroplasts 
isoform of glyceraldehyde-3-phosphate (GAP) dehydrogenase. 

1 VPKGFTEFSG LRSSSASLPF GKKLSSDEFV SIVSFQTSAM GSSGGYRKGV 

51 TEAKLKVAIN GFGRIGRNFL RCWHGRKDSP LDIIAINDTG GVKQASHLLK 

101 YDSTLGIFDA DVKPSGETAI SVDGKIIQW SNRNPSLLPW KELGIDIVIE 

151 GTGVFVDREG AGKHIEAGAX KVIITAPGKG DIPTYWGVN ADAYSHDEPI 

201 ISNASCTTNC LAPFVKVLDQ KFGIIKGTMT TTHSYTGDQR LLDASHRDLR 

251 RARAAALNIV PTSTGAAKAV ALVLPNLKGK LNGIALRVPT PNVSWDLW 

301 QVSKKTFAEE VNAAFRDSAE KELKGILDVC DEPLVSVDFR CSDFSTTIDS 

351 SLTMVMGDDM VKVIAWYDNE WGYSQRWDL ADIVANNWK* FHQTLFHVFP 

401 FFSLSLFYIM VLKISVKTV 
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Figure 7A: SEQ ID NO: 5 
Adonis palaestina cDNA 
Length: 1619 June 4, 1998 
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1 ccttcgtctc 

51 cgtttctcca 

101 tcgcctctac 

151 gaccttcagg 

201 tctttccgcc 

251 cggccacaaa 

3 01 atgttattaa 

351 gttacagtga 

401 agctgttcag 

451 tttggattac 

501 gaagagatgg 

551 tgatgttgtt 

601 ccgtaagtga 

651 acaacttgcc 

701 gaagggagag 

751 ctattgcaac 

801 gatgaggcaa 

851 atccagctca 

901 ccgaaggatt 

951 caaactacaa 

1001 aaagactatg 

1051 taagctttaa 

1101 tttaagttgg 

1151 gaattcaagt 

1201 ttccttctta 

1251 attagccaca 

1301 cccagagggt 

1351 ataaggctgt 

1401 caattgttgc 

1451 caccactgtt 

1501 atatagaaac 

1551 ccataatgat 
1601 



tgcacttctt 
ccccatctga 
cgatccaaga 
ctcctcctcc 
acaaccttac 
gatgagactc 
gaagctgaag 
aattggctga 
actgcttatg 
taatgagatt 
aagttaaaga 
gcaaaggctg 
aatg^wgact 
catgggtatc 
tiacacaccaa tz 
tgcatcttrt 
tgtatgcgcg 
gacaaacacg 

tgatccagac 
tgttgaaccg 

atgcgcaagt 
cacaatatgt 
tagaggagaa 
aacacctcac 
ctggattgac 
aattgaatca 
cctgttacaa 
tgaagatgct 
agctagcata 
caatatgtgt 
tgtaatacct 
gtaaattttt 



tcttccatgg cgacttctct 
tctctccttc cctgaaaccc 
aaccattttc cgtccgttgt 
acggcagtgg aatccgaatt 
aagaagcaag aactacaacc 
ttgagcttat gaacagcgag 
gagaatggaa acgagtacag 
gtcgtatggg ttctgttggg 
aagctagaaa acagtttccc 
atccataatc caactgttaa 
gattccagtt gggga t ggga 
acgctgtgat tttgccggct 
trgagcgaca aaaatgtcca 
taacgtgtgg aattccgttg 
ccatrcacgg taaatattcc 
gctgggaact atatcattgc 
cgaczacacr ctcggaggtg 
cacttttgca gaaattcaaa 
acagatctta ttaaaacagg 
agaaacagaa gatattggta* 
atggagttga gaatattaat 
gatgctacac aagagagaca 
agtggatctc atattagtag 
atctacaaga aatttctgag 
agtgagacga gaataggacc 
tggtgagcta gttgagaccg 
ttggtgtgac atcaggtgct 
cttgtaaagg tgtjicgacat 
gagaccagtg ttctgcagag 
gattgtatat gtatcatgga 
tgttatgtta tactatatgg 
agttccatgc aacaatattt 



tcaattctgc 
gctcttccac 



cgatgccaag 
gaagaggttt 
tatacaagtg 
ttggggacct 
gtgtggaaag 
gatgagaaga 
caagagatta 
agaaacattt 
tttggagctg 
gatagtcgac 
agaagcataa 
cacgaggaaa 



agcttaatgg 
tacgcaattt 
cattgcaaac 
aacttctcga 
gaccacttca 
agacgccatg 
ttggtggatg 
ttacgtggca 
aggaaacaag 



tctaccccag 
caagcgcgaa 
atcaaagaac 



atgtccttta 
tcttgaatga 



aaaaaaaaaa aaaaaaaaa 
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Figure 7B: SEQ ID NO: 6 1 m 3 

Predicted amino acid sequence of Adonis palaestina LYTB 

Length: 465 Translate 27-1421 

1 MATSLQFCRF STPSDLSFPE TRSSTRLYRS KKPFSVRCHS EGPSGSSSTA 

51 VESEFDAKSF RHNLTRSKNY NRRGFGHKDE TLELMNSEYT SDVTKKLKEN 

101 GNEYSWGPVT VKLAESYGFC WGVERAVQIA YEARKQFPDE KIWITNEIIH 

151 NPTVNKRLEE MEVKEIPVGD GKKHFDWAK GDWTLPAFG AAVSEMLTLS 

201 EKNVQIVDTT CPWVSKVWNS VEKHKKGETT SIIHGKYSHE ETIATASFAG 

251 KYIIVKNMDE AMYVCDYILG GELNGSSSDK QALLEKFKYA ISEGFDPDTD 

301 LIKTGIANQT TMLKGETEDI GKLLEKXMMR KYGVENINDH FISFNTICDA 

351 TQERQDAMFK LVEEKVDLIL WGGWNSSNT SHLQEISELR GIPSYWIDSE 

401 TRIGPGNKIS HKLNHGELVE TENWLPEGPV TIGVTSGAST PDKAVEDALV 

451 KVFDIKREQL LQLA* 
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FIGURE 8 
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FIGURE 9 
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